KMID : 1137820070280030355
|
|
ÀÇ°øÇÐȸÁö 2007 Volume.28 No. 3 p.355 ~ p.362
|
|
Prediction of Diabetic Nephropathy from Diabetes Dataset Using Feature Selection Methods and SVM Learning
|
|
Cho Baek-Hwan
Lee Jong-Shill Chee Young-Joon Kim Kwang-Won Kim In-Young Kim Sun-Il
|
|
Abstract
|
|
|
Diabetes mellitus can cause devastating complications, which often result in disability and death, and diabetic nephropathy is a leading cause of death in people with diabetes. In this study, we tried to predict the onset of diabetic nephropathy from an irregular and unbalanced diabetic dataset. We collected clinical data from 292 patients with type 2 diabetes and performed preprocessing to extract 184 features to resolve the irregularity of the dataset. We compared several feature selection methods, such as ReliefF and sensitivity analysis, to remove redundant features and improve the classification performance. We also compared learning methods with support vector machine, such as equal cost learning and cost-sensitive learning to tackle the unbalanced problem in the dataset. The best classifier with the 39 selected features gave 0.969 of the area under the curve by receiver operation characteristics analysis, which represents that our method can predict diabetic nephropathy with high generalization performance from an irregular and unbalanced dataset, and physicians can benefit from it for predicting diabetic nephropathy.
|
|
KEYWORD
|
|
diabetic nephropathy, feature selection, support vector machine, cost-sensitive learning
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|